In the OCR Format area, users can select the desired format of the extracted/recognized text output.
OCR format could be one of the following options:
Plain Text option generates plain text output without formatting, position and font information.
hOCR option generates an XHTML formatted output file with position and layout information with HTML file extension. This option is recommended if one intend to convert the text back to formatted document. For more information about hOCR, refer to the following link: https://kba.github.io/hocr-spec/1.2/